Spectral clustering of protein sequences
نویسندگان
چکیده
منابع مشابه
Spectral clustering of protein sequences
An important problem in genomics is automatically clustering homologous proteins when only sequence information is available. Most methods for clustering proteins are local, and are based on simply thresholding a measure related to sequence distance. We first show how locality limits the performance of such methods by analysing the distribution of distances between protein sequences. We then pr...
متن کاملSpectral Analysis of Protein Sequences
Wang, Zhi. Spectral Analysis of Protein Sequences. (Under the direction of Dr. William R. Atchley and Dr. Charles E. Smith.) The purpose of this research is to elucidate how to apply spectral analysis methods to understand the structure, function and evolution of protein sequences. In the first part of this research, spectral analyses have been applied to the basichelix-loop-helix (bHLH) family...
متن کاملClustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملTowards Automatic Clustering of Protein Sequences
Analyzing protein sequence data becomes increasingly important recently. Most previous work on this area has mainly focused on building classification models. In this paper, we investigate in the problem of automatic clustering of unlabeled protein sequences. As a widely recognized technique in statistics and computer science, clustering has been proven very useful in detecting unknown object c...
متن کاملProtein Function Prediction by Spectral Clustering of Protein Interaction Network
The increasing availability of large-scale protein-protein interaction (PPI) data has made it possible to understand the basic components and organization of cell machinery from the network level. Many studies have shown that clustering protein interaction network (PIN) is an effective approach for identifying protein complexes or functional modules. A significant number of proteins in such PIN...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nucleic Acids Research
سال: 2006
ISSN: 0305-1048,1362-4962
DOI: 10.1093/nar/gkj515